165 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch
Availability:
From Owner
License:
Creative Commons BY-NC-SA 3.0
Size:
305000 tokens Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:CLiPS Stylometry Investigation (CSI) corpus: A Dutch corpus for the detection of age, gender, personality, sentiment and deception in text
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ben Verhoeven | CLiPS, University of Antwerp | BE |
| Author 2 | Walter Daelemans | University of Antwerp, CLiPS | BE |
| Main Contact | Ben Verhoeven | CLiPS, University of Antwerp | None |
Documentation:
Documentation publicly available in EnglishLanguage Type:
Multilingual
Languages:
Dutch
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Acquisition
-
Paper title:The Dutch LESLLA Corpus
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Eric Sanders | CLS/CLST, Radboud University Nijmegen | NL |
| Author 2 | Ineke van de Craats | CLS, Radboud University Nijmegen | NL |
| Author 3 | Vanja de Lint | CLS/CLST, Radboud University Nijmegen | NL |
| Main Contact | Eric Sanders | CLS/CLST, Radboud University Nijmegen | None |
Documentation:
<Not Specified>
Written
Text-to-Speech Synthesizer,
Language Type:
Multilingual
Languages:
Dutch
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Web Services
-
Paper title:Speech Recognition Web Services for Dutch
-
Paper track:Speech
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Joris Pelemans | KU Leuven | BE |
| Author 2 | Kris Demuynck | Ghent University | BE |
| Author 3 | Hugo Van hamme | <Not Specified> | BE |
| Author 4 | Patrick Wambacq | KU Leuven | BE |
| Main Contact | Joris Pelemans | KU Leuven | None |
Documentation:
<Not Specified>
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese Dutch Finnish French German Greek Hungarian Japanese Russian Spanish
Availability:
Freely Available
License:
Apache-2.0
Size:
None Production Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
-
Paper track:7.14 Cross-lingual and multilingual aspects in spe/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tomáš Nekvinda | CSS10 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese Dutch French German Russian
Availability:
Freely Available
License:
Creative Commons CC0
Size:
None Production Status:
Existing-updated
Use:
Speech Synthesis
-
Paper title:One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
-
Paper track:7.14 Cross-lingual and multilingual aspects in spe/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tomáš Nekvinda | Cleaned Common Voice | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Dari/Pashto Dutch English Finnish French Hindi Icelandic Indonesian Japanese Lithuanian Malay Mandarin Nepali Portuguese Punjabi Romanian Slovenian Spanish
Availability:
From Owner
License:
CreativeCommons
Size:
467 hours Production Status:
Newly created-finished
Use:
Person Identification
-
Paper title:JukeBox: A Multilingual Singer Recognition Dataset
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anurag Chowdhury | JukeBox | /N |
Documentation:
Documentation in English language will be made available upon publication of the dataset.
Written
Terminology,
Language Type:
Multilingual
Languages:
Arabic Dutch English French German Modern Greek Russian Spanish
Availability:
Freely Available
License:
Size:
4473 concepts Production Status:
Existing-updated
Use:
Acquisition
-
Paper title:Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based Study
-
Paper track:Terminology/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Pilar León-Araúz | EcoLexicon | /N |
Documentation:
https://ecolexicon.ugr.es/en/manual.htm
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese Dutch French German Italian Mongolian Persian Russian Spanish Swedish Turkish
Availability:
Freely Available
License:
CC0
Size:
700 hours Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus
-
Paper track:Speech/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Changhan Wang | CoVoST | /N |
Documentation:
https://github.com/facebookresearch/covost
Written
Lexicon,
Language Type:
Multilingual
Languages:
Albanian Arabic Basque Bulgarian Catalan Chinese Croatian Danish Dutch English Finnish French Galician Greek Hebrew Icelandic Indonesian Italian Japanese Lithuanian Malay Norwegian Persian Polish Portuguese Romanian Slovak Slovene Spanish Swedish Thai
Availability:
Freely Available
License:
Multiple Licenses
Size:
1072646 synsets Production Status:
Existing-used
Use:
All of the above
-
Paper title:Some Issues with Building a Multilingual Wordnet
-
Paper track:Infrastructural Issues/Large Projects/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | John P. McCrae | Open Multilingual WordNet | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German Spanish french
Availability:
From Owner
License:
<Not Specified>
Size:
334.4 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jérémy Ferrero | Université Grenoble Alpes | FR |
| Author 2 | Frédéric Agnès | Compilatio | FR |
| Author 3 | Laurent Besacier | LIG | FR |
| Author 4 | Didier Schwab | Univ. Grenoble Alpes | FR |
| Main Contact | Jérémy Ferrero | Université Grenoble Alpes | None |
Documentation:
<Not Specified>




